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Abstract 

We consider generalizations of depolarizing channels to maps of the form 
$(p) = Yjk a ^kpVl + (1 - a)(Ttp) \l with V k unitary and Y,k a k = a < l - 
We show that one can construct unital channels of this type for which the 
input which achieves maximal output purity is unique. We give conditions on 
T4 under which multiplicativity of the maximal p-norm and additivity of the 
minimal output entropy can be proved for $ f2 with f2 arbitrary. We also 
show that the Holevo capacity need not equal logd — S m i n (&) as one might 
expect for a convex combination of unitary conjugations. 
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1 Introduction 




(i) 



with 



i 



< a < 1. In this paper, we consider channels of the more general form 



Hp) = E a ^P y l + ^ ~ fl )( Tr -°) a 7 



(2) 



k 



with < dfc, < a = afc < 1 and \4 unitary. 

We describe and study several subclasses of these channels (J2J), showing that they 
can exhibit different types of behavior. Those with simultaneously diagonal have 
a high level of symmetry and much in common with depolarizing channels. How- 
ever, we also construct asymmetric channels with a unique state of minimal output 
entropy and other behavior more typical of non-unital channels; although additivity 
can be proved for the minimal output entropy, this does not imply additivity of the 
capacity because the optimal average output is not ^7. 

This paper is organized as follows. Section El contains some terminology and 
notation as well as considerable background material on various types of channels 
and their behavior. In Section 01 we state and prove some theorems about minimal 
output purity for the channels we consider. In Section 0] we consider a special 
subclass of channels which satisfy (0) and exhibit behavior similar to unital qubit 
channels. In Section which is the heart of the paper, we describe several types 
of asymmetric channels to which our results can be applied. In Section |U] we report 
the results of numerical tests on channel capacity. 

2 Background 

2.1 General notation and terminology 

We restrict attention to finite dimensional spaces C d and denote the space of d X d 
complex matrices as Ma = B(C d ). By a channel $ we mean a completely positive, 
trace preserving (CPT) map $ : \— > Ma- Let D = {p : p > 0, Tr p = 1} denote 
the set of density matrices in M^. Let 5(7) = — Tr7log7 denote the quantum 
entropy of a state 7 G T>. For a CPT map $, one can define the maximal output 
p-norm 



z/ p ($) = sup || $(7) || 



(3) 



■yev 
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the minimal output entropy 

S^i*) = MS[*(p)] t (4) 

and the Holevo capacity 

c Holv ($)= sup (smp^-y^TTjsmpj)}), (5) 

where p av = J2j n jPji an d the supremum is taken over all ensembles {7ij,pj} with 
Pj G £>, iTj > and ^-TTj = 1. Both 5' m i n ($) and Cn \ v (&) are conjectured to be 
additive over tensor products, i.e., to satisfy 

S min ($®fi) = S min ($) + S min (n), and (6) 

C H olv($®^) = C , Holv($)+C H olv(fi) (7) 

Shor showed [HI] that these conjectures (and several related ones) are equivalent in 
the global sense that both are either true for all general channels $ : Ma i— ► M ra or 
both are false. However, they are not necessarily equivalent for individual channels, 
and we will study them separately for the examples in this paper. 

Shor also proved [SI] that both (jOJ) and (J2J) hold for entanglement breaking (EB) 
channels. King ^H] gave an alternative proof based on multiplicativity of f p ( ( &). A 
CP map $ is EB if (J ® $)(p) is separable for all input states p. A CPT map which 
is also EB is denoted as EBT. It was shown in [T5| that a CP map is EB if all its 
Kraus operators can be chosen to have rank one, or if (/ ® is separable 

for some maximally entangled \^). Any EBT channel be written as 

HP) =$> fc Trp£ fc , (8) 

k 

with {Ek} a POVM, and each 7^. G T>. When {|efc)} is an orthonormal basis for 
C d and Ek = |efc)(ek| the channel is called CQ (classical-quantum); and when each 
Ik = |efc)(efc| it is called QC (quantum-classical). 

The following max-min characterizations of Ch 1v( < ^ ) ) in terms of the relative 
entropy H(p,j) = Trp(logp — log 7) are extremely useful. They were obtained 
independently in and [28J. 

C H oiv($) = inf supi7[$M,$( 7 )] (9a) 
7ez> tJ ex> 

= suptf[$H,$(p av )] (9b) 

= F[$(p,),$(p av )], (9c) 

where p av is the optimal average input and pj is any input in the optimal signal en- 
semble. It can be shown ^T] that (|9bJ) and (|9*cj) are equivalent to the statement that 
the points (pi, S(pi)) define a supporting hyperplane for the convex optimization 
problem (J3j). 
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2.2 Depolarizing channels 

The properties of the depolarizing channel are well-known and can be summarized 
as follows. 

Theorem 1 The depolarizing channel (0) satisifies 

a) rf p (/) is unital, i.e., Tfv(I) = I. 

b) The output r^ ep (|^) for any pure state \ip)(ip\ has eigenvalues [a+^p, . . . 

c) For any CPT map Q, v p (T a cp ® fi) = z/ p (rf p )z/ p (fi) V p > 1. 
d; For any CPT map O, S min (rf p ® fi) = 5 min (rf p ) + 5 mia (0). 

e ; ChoivW p ) = logd - 5 min (rf ep ). 

/J T/ie capacity Cuoiv{^a ep ) can ^ e achieved using d orthogonal input states. 

g) The optimal average input is \l- 

h) For any CPT map fi, C Uo w(K cp ® = C H oiv(rf p ) + C Ho iv(fi) 
ij Wften a < -jxt, t/ie channel T^ p zs EST. 

The mutiplicativity (c) was proved by King J7j for any depolarizing map, including 
those with negative a; he also showed that properties (d) and (h) follow. Properties 
(d) and (h) were proved independently by Fujiwara and Hashizume [Hj for maps 
with a > and = r^ ep ; they used a majorization argument which also implies 
(c). Properties (a), (b) and (e) are well-known and easily verified. Property (j) 
can be verified by computing the Choi matrix (I <g> T a £p )(\f5){f3\) for a maximally 
entangled state \f3) and using Theorem 4 of [To] . 

It is useful to introduce the generalized Pauli operators X d and Z d defined on 
the standard basis so that X d \ei) = |e^+i) with the addition in the subscript taken 
mod d and Zd\e.t) — e 2m£ ^ d . Then for any d x d matrix A, 

d-l d-l 

& EE x ™^(^) n ^) m = ( Tr ( 10 ) 

m=0 n=0 

and 

d-l d-l 

rf p (p) = [a+^]/p/+(l-a)^^^X d m Z>(Zt)«(X]) m (11) 

m=0 n=0 



Cortese j3] considered channels of the form 

d-l d-l 

*m = Y,Y, c ^xTz n d p{zinxir (12) 

m=0 n=0 

with c mn > and ^2 mn c mn = 1, and showed that 

C H oiv($) =logrf-5 min ($). (13) 

A simplified proof of this result was given by Holevo [13], who showed that ([THj) 
holds for channels satisfying the covariance condition 

*{U t pU}) = U' a Q(p)ptf VgeG (14) 

when {U g } and {U' g } are irreducible representations of a group Q. The case (JUj) is 
called "Weyl covariance". 

By using ([TUj) to rewrite the second term in (2) and the fact that = a, 

one sees that such channels can be expressed as a convex combination of unitary 
conjugations. We write them in the form (2) because we exploit their relationship 
to the depolarizing channel. However, ([T3*j) need not hold for all channels of the 
form Q; in Sections El we give examples which show that they can exhibit very 
different behavior. 



2.3 Qubit channels 

As discussed in Appendix |Bl a unital qubit channel can be written (after rotation 
of bases) [221 as 

3 

$ 0°) = ^akVkpVk- (15) 

k=0 

It is also useful to recall that any qubit density matrix can be written as p = 
| [/ + w • a], where a denotes the vector of Pauli matrices and w e C 3 ; then the 
channel (|15|) can be written as 

3 

$ (p) = M J + Ev^]- ( 16 ) 

3=1 

The relations between the parameters {a^} and {Xj} are discussed in Appendix IB1 
The following theorem was proved by King in [To] . 
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Theorem 2 Let $ be a unital qubit channel and a = max |AJ = max aij + a,-. 

fc=l,2,3 i^ie0,l,2,3 

Then parts (c) to (h) of Theorem^ hold, with I^ ep replaced by In addition, for 
those k with \Xk\ = a, the inputs |[J ± er^] yield outputs with eigenvalues |(1 ± a) 
and, hence, have the same entropy as the corresponding qubit depolarizing channel. 

This implies that all unital qubit channels for which the image ellipsoid of the 
Bloch sphere touches, but lies within, the sphere of radius a (which is the image of a 
depolarizing channel) have the same capacity and minimal output entropy behavior. 
A unital qubit channel is EBT j2Sj if and only if Ylk \^k\ — 1 or, equivalently, if 
Ofe < \ for all k. 

A non-unital qubit channel can be written (after rotation of bases) j22] m the 
form 

3 

$ : | [/ + w • a] ^ | [I + J^itk + AfcWfcK] • (17) 

k=l 

The conditions imposed on tk and A^ by the CPT requirement are given in J22J and 
summarized in 26J. (The special case t\ = t2 = was considered earlier in [Zj.) 
One expects the generic behavior of non-unital qubit channels to be quite different 
from that of unital ones. 

A) Non-unital qubit channels typically have a unique state of optimal output 
purity. This always holds when tk ^ in the direction for which the ellipsoid 
axis | A | is longest. 

If tk 7^ only in direction(s) orthogonal to the longest axis, then one typically 
has two non-orthogonal states of optimal output purity (although these can 
coalesce into one, as for extreme amplitude damping channels, and can come 
from orthogonal inputs for a CQ channel) |H| I2()j. 

B) Cho1v(^ ) ) < logc/ — S' m i n ( < l ) ) for all non-unital qubit maps. 

C) In general, the capacity Ch 1v( ( ^ ) ) can not be achieved using d orthogonal input 
states 0HHIH1I2E]- 

There are, however, a number of exceptions. Two of these are CQ maps 
which take | [J + w • a] i— > | [I + t i a 1 + A 3 u>3<73] and QC maps which take 
2 [/ + w • a] i— > g [I + (^3 + Asu^)^] . The QC channels are included in the 
larger class of channels for which tk ^ only for the largest \Xk\; then Cho1v( < ^ ) ) 
can be achieved with a pair of orthogonal inputs [3 120] • 

D) Properties (c), (d), and (h) of Theorem 1 are conjectured to hold for non-unital 
qubit maps; however, a proof is known only for (c) in the case p = 2. 
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2.4 Some channels for d > 2 



When <£> maps a larger space into qubit density matrices, it is possible to have 
Cho1v(<3>) = logd — S , min ($), even when the optimal input p av ^ ^1. This is the case 
for Shor's extended channel in Section 9 of [SI]. In that case, the original map $ is 
extended to $ ex t for which the optimal average input is i? av = p min ® 4-7, with p min 
achieving S' m i n ($) for the original channel. Then $ ext (-R av ) = \l- Note that one 
also has $ ext (/rf ® i^ 2 ) = I d so that $ ex t is unital. Moreover, if S' m i n ($) is achieved 
for more than one state, then the optimal average input is not unique, although the 
optimal average output is unique. 

For qubits, a channel is unital if and only if it can be written as a convex 
combination of unitary conjugations [22] • It is well-known that this result does not 
extend to d > 2. One well-known example is the Werner- Ho levo channel [33] for 
which the Kraus operators can be written as partial isometries. This example does 
satisfy (fT3|) as well as © and (J7J), although it has only been shown to satisfy (J2T?|) 
when 1 < p < 2 [2. and is known violate (|2*U|) for large p. 

For d = 3, Fuchs, et al jHj found a unital channel which satisfies (|13|) but for 
which the optimal inputs are not orthogonal. This channel is given by Eq. (19) of 

The asymmetric examples in Section appear to be the first for which a unital 
channel does not satisfy (|13|). 

It is natural to look for classifications of unital channels which include a type 
whose behavior is similar to that of unital qubit channels. The results presented 
here show that there are channels which can be written as convex combination of 
unitary conjugations which do not exhibit this behavior. Thus we are left with the 
conjecture that channels of the form (|12j) behave like unital qubit channels and, 
hence, satisfy (c) to (h) of Theorem 1 with T replaced by $, as in Theorem 2. 



2.5 Majorization 

We will use the notation [x\, X2, ■ ■ ■ x n ] y [yx, y2, ■ ■ ■ y n ) to indicate that both sets are 
non-negative and arranged in non- increasing order x\ > 22 > £3 . . . > and satisfy 

k k n n 

the majorization condition x« > yi for k = 1 . . . n—1 and = yi. It 

i=l i=l i=l i=l 

is well-known [Hj 12*3] that this implies 

n n 

E4>E^ P ( 18 ) 
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for all p > 1. Therefore, whenever p and 7 are density matrices for which the 
eigenvalues of p majorize those of 7, \\p\\ p > \\j\\ p and S(p) < S(j). 

When only an inequality holds for k = n, we use the term submajorize, and 
observe that the same conclusions follow by extending both sets with x n+ i = and 
y n+ i chosen to give equality. 



3 Results on minimal output purity 

In this section we state and prove some theorems on the minimal output purity of 
certain subclasses of the channels defined by (J2J). 

Theorem 3 Let $ be a channel of the form (J2J) for which all of the unitary operators 
Vk have a common eigenvector Then for any CPT map Q 

a) ||$(|^)(^|)|| p = z/p ($) = ^(rfP) Vp>l (19) 

b) z/ p ($ <g> n) = v p ($)v p (tt) Vp>l (20) 

cj smmm = s min ($) = 5 min (rfp) (21) 

d) S min ($ ® n) = 5 min ($) + 5 min (fi) (22) 



Proof: First, observe that 



ap + {l-a)(Trp) \l 



V! (23) 



is a convex combination of conjugation with Vk composed with the depolarizing 
channel. Therefore, for any density matrix p 

\\*(p)\\p < J2^w v ^ cp (p)vX 

k 
k 

Now consider p = with the common eigenvector of Then 

WHmmip = NMV>I + = ra cp (l^)(^l) = ^ P (rf p ), 

where we used part (b) of Theorem[T] Therefore, v p {®) is at least as big as z/ p (r^ ep ). 
Combining this with ()24p . proves part (a). 
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To prove (b), we proceed similarly, using (|23|). to see that 

\m®mpn)\\ P < Evii( r a cp ® fi )(pi 2 )iip (25) 

k 

< E^^( r a CP )^) (26) 

k 

= u p (vfn i/ p (n) = !/„($) i/ p (n) (27) 

where the last step used part (a). Since we can achieve z/ p ($)i/ p (f2) using a product 
state, this proves (b). Parts (c) and (d) then follow by the established technique 3\ 
of taking the right derivative at p — 1. QED 

By choosing all Vk = W k with W a unitary matrix which generates a cyclic group 
of order d, one can construct channels with precisely d input states whose outputs 
have optimal purity Additional channels with d states of optimal output purity are 
discussed in Section HJ Channels for which each Vk has the form Y^jLx \fj) (fj \ © Wk 
with \fj) a set of m mutually orthonormal vectors and Wk unitary operators on 
[span{|/j)}] are more interesting. Several classes of examples are discussed in 
detail in Section When the Wk have no common eigenvectors, it follows from 
Theorem 0] below that these channels have precisely m mutually orthogonal states 
of optimal purity. One can construct channels with m = 1,2, ... d — 2; however, 
if the Vk have d—1 common eigenvectors, then they have d common eigenvectors, 
precluding the possibility that m = d — 1. 

Theorem 4 Let $ be a channel of the form (J2J) and let p be any density matrix 
other than the projection onto a common pure state eigenvector of all Vk- Then 
\Mp)\\ p < v p (J0^) and S[Q(p)] > S min (T^). 

Proof: Under the hypothesis of the theorem, 

IIE^ V *L< 1 ( 28 ) 

and one can write the eigenvalues of —VkpVl as [xx, £2, ■ ■ ■ %d] with x\ < 1. 
Then the eigenvectors of $(p) are 

[axx + 1 -^,ax 2 + 1 -^ J ... ax d + ^] -< [a + . . . ^]. (29) 

Thus, the eigenvalues of are majorized by those of T^ ep (\ip){x(j\ for any pure 
input \ip). QED 

Theorem 5 Let $ be a channel of the form (J2J) for which the unitary operators 
Vk have precisely m mutually orthogonal common eigenvectors with m < d. Then 
Pav 7^ \l o,nd at least (d—m) states in the optimal input ensemble have S[$(pi)] > 
5 min ($). 
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Proof: When the number of common eigenvectors m < d , it follows that one can 
not find a set of d mutually orthogonal pure inputs pi for which S[$(pj)] = Smin^)- 
Therefore, one can not find an input ensemble such that both £\ ^iPi = anc ^ 
S[$(pi)] = S' m i n ($) Vz hold. Therefore, we must have 

C Ho iv($)<logd-S min ($). (30) 

Since 



sup #[$(«;), = logrf - inf S[$(u)] = log d - 5 min ($), (31) 

it follows from (j3Tf|) and (jHJ) that 4/ is not the optimal average input. 

If we know that the optimal signal ensemble has at least d inputs, then at least 
d — m of them must satisfy > S min (<I>). QED 

Although we are primarily interested in channels which are trace-preserving, 
multiplicativity results, e.g., (J2Uj) can often be proved using only the CP condition. 
Moreover, Audenaert and Braunstein pp showed that multiplicativity of a special 
class of CP maps would imply superadditivity of entanglement of formation. There- 
fore, we notice that a weaker version of Theorem El can be extended to maps of the 
form (|23|) in which the are contractions rather than unitary, i.e. VyVl — I- 



Theorem 6 Let $ be a CP map of the form 

$ (p) = Eif^k + ( 1 - a )( Tr p)^ 



(32) 



for which all of the operators V& are contractions with a common eigenvector 



satisfying Vk\ 



. Then for any CP map Q, (fTTIj) - flU hold. 



Proof: The assumption that the eigenvalues of the common eigenvector have 
| e «ffc| _ i implies that v p (&) is at least as large as i/ p (r^ ep ). For any contrac- 
tion V, the eigenvalues of VAV^ are submajorized by those of A, which we write as 
[ax, «2 • • ■ oid]- To see this, write A = UA^U^ with U unitary and Ad the diagonal 
matrix with elements SjkOtj- Then X = VU is also a contraction and the diagonal 
elements of VAV^ are Tlj\ x ij\ 2a j which are submajorized by [ax, a-i . . . a<j\. By 
applying this to A = ap + (1 — a)\l, the result follows by the same argument as 
before. 
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4 Diagonal Vk 



Before discussing several types of asymmetric channels, we consider channels for 
which all are simultaneously diagonal, as well as unitary. This includes the case 
Vfc = W k , with W d = I, mentioned earlier. In all these situations, one has precisely 
d states of minimal output entropy and the capacity is 

C H olv($) = logd - S min ($) = logd - S min ^a P )- (33) 

It then follows from the additivity of S , m i n ($) in part (d) of Theorem El that Ch 1v( < ^ ) ) 
is also additive in the sense ChoIv^ <g) $) = 2Ch 1v(3 ) )- 

The channels considered in this section are, therefore, convex combinations 

$(p) = a$ di ^(p) + (l-a)(Trp)i/ (34) 

of the completely noisy map and a "diagonal channel" of the form $ dia §(^) = 
J2k a kYklVk w hh a k > 0. The term diagonal channel was introduced by King 
[T§] for CP maps whose Kraus operators are simultaneously diagonal. King also 
showed that $ diag (7) = B * 7 where * denotes the Hadamard product , B is a posi- 
tive semi-definite matrix, and 7 is written in the basis in which the Vk are diagonal. 
When Vk is unitary, its diagonal elements can be written as e lr ^ km , m = l,2...d 
and b mn = ^ k ake t<y ^ km ~^ kn \ If one also requires $ diag to be trace-preserving, then 
ctfc = 1 and b mm = 1 Vm. This implies that the states \m)(m\ are fixed points of 
$ diag so that it has d pure state outputs. Hence additivity of both minimal output 
entropy and Holevo capacity hold trivially for diagonal CPT maps. 

In the examples (I34j) considered here, the corresponding outputs are $(|m) (m\) = 
a\m)(m\ + (1— a)hl, m = 1, 2, . . . d which yield d states of minimal output entropy. 
As noted above, this implies, that they satisfy (|13j) and ((7|) when Q — <&. Since 
Theorem El holds, (fEH) - (f22|) are also satisfied. 

The depolarizing channel, satisfies the general covariance condition Q(UpW) 
= UQ(p)W for arbitrary unitary matrices U, but this does not extend to channels of 
the form (0). However, when = W k with W = UX^W and U unitary, the channel 
satisfies the weaker condition (J 14)) using the generalized Pauli matrices UX^Z^W . 

Note that W = UXJU* is equivalent to the assumption that W has eigenvalues 
e i2irm/d^ m = o, 1 . . . d— 1. However, one can have a unitary W with W d = I, W m ^ 
I, m < d but W 7^ UX d W . For example, with d — 5, choose W to have eigenvalues 

i27r/5 p i2vr/5 i2vr3/5 i i 

More generally, of course, one could choose 14 with eigenvalues e^ km without 
any rational relationship between eigenvalues for a single Vk or between those for Vj 
and Vk. Then (fT3|) still holds, despite the absence of any obvious group for which 
((HI) holds. However, we can not completely exclude the possibility of a hidden 
group. 
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5 Asymmetric examples 



5.1 Qutrit channels 

We will now study in detail the case d = 3, with 

V k = e ie \e )(e \®<r k = K °Y k G {0, 1, 2, 3} (35) 



, 

with the convention that ctq = I. As discussed in Appendix |B] we can assume that 

do > Ox- 

It follows from Theorems El and El that $ has exactly one state of minimal output 
entropy |eo)(eo| and two orthogonal states |e±)(e±| = |[I±<7i] whose outputs have 
eigenvalues [a 1 ^ + ±=^, a±^ + i=^]. Here Ai is given by with i = 1. If 
these states are the optimal inputs pj, symmetry implies that the optimal average 
input has the form 

p av = (1 - 2x)|e )(e | + x |e +1 )(e +1 | + x |e_x)(e_x|, (36) 

for which the optimal average output is 

$(Pav) = (o(l-2ar) + ^)|e )(e | + {ax + ±f*) (|e + )(e + | + |e_)(e_|). (37) 

We want to optimize the capacity 

SM Pav )(x)} - [(1 - 2x)Sm Po )] + xS[*(p +1 )) + xS&ip-t)]. (38) 

Since, 5[$(p + i)] = S[$( P -i)], differentiating (J3~8|) gives the condition 

2a log (±ft - 2ax) - 2a log + ax) = -25[$(p )] + 2S[$(p± 1 )] (39) 

or 

1 - a + 3ax , 

log = —-AS (40) 

6 1 + 2a - 6ax K J 

where AS = S[$(p +1 )] - S[$(j>o)] > 0. This has the solution 

x = (l + 2a)2- A ^-(l-a) , . 

3a(l + 2"^/«.2) ' K ' 

It is easy to verify that x < ~ confirming the intuition that the optimal input will 
be shifted toward the state |eo). 

Let p x denote the average for the ensemble corresponding to the optimal x f4*T]) 
and Cg olv ($) the corresponding capacity (jHHJl- To show that p x is the true optimal 
average which yields ChoIv^), we need to verify that H[$(ui),$( Px )] < C , g olv ( ( l ) ) 
for all choices of u. This has been done numerically for a large range of a and Ax. 
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5.2 Doubly depolarizing channels 

We introduce some notation. Let {|e,-)(ej|} be an orthonormal basis for C d , E m the 
projection on span{|ei), fa) ■ ■ ■ |e m )}, and E^ is the projection on the orthogonal 
complement span{|e m ), \e m+1 ) . . . \e d )} 

Now suppose that $ is a channel of the form (J2J) in which each has the form 
Vk = E m (BWk = Tjl J where the Wk are chosen to be unitary (d—m) x (d—m) 



v w K 

matrices such that on E^H 

E f W >*P W t = b P + (l-b) (Tr E , H p) -^Ei. (42) 
k 

It suffices to choose Wk to be the generalized Pauli matrices defined before (fTUj) and 
let 0^ = a(l — b)/(d — m) 2 for all k except ao = a[b(d — m) 2 + (1 — b)]/(d — m) 2 . 
For the case d = 4 and m — 2, this reduces to = with a = a(36 + l)/4 and 
cij = a(l-6)/4for j = 1,2,3. 

The action of $ is similar to a depolarizing channel when restricted to E m Ti, or 
E^H. More precisely, 

$(|e)(e|) = a|e)(e| + (1 - a)\l V |e) G £ m W (43) 

H\f)(f\) = ab\f)(f\ + a(l - 6)^£^ + (1 - a)±J V |/> G i£w (44) 

The case m=l,<i = 3isa special case of the channels in the preceding section. 

We expect that capacity can be achieved by a (non-unique) ensemble with d 
inputs consisting of m orthogonal vectors in E m TC and d — m orthogonal vectors in 
E^H. (There is no loss of generality in assuming that the optimal inputs can 
be written as pj = \ej)(ej\.) By symmetry the probabilities for such an opti- 

{t for j ^ Tfi 
~ with mt + (d — m)t J - = 1. Thus 
t for j > m 

p av = tE m + t L E^ and 

$(p av ) = atE m + at ± E^+(l-a)±I, (45) 

so that Cho1v($) is the result of optimizing 

S(^(p w ))-mtS[^(\e l )(e l \)]-(d-m)t ± S[^(\e d )(e d \)}. (46) 

One finds that the optimal t satisfies 

, adt 1 - + 1 - a 

a log — — = -AS 47 

adt + 1 — a 
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where AS = S[&(\e d ) (e d \)] — S[Q>(\ei) (ex\)] > 0. This implies that, as expected, the 
solution will have t > ^ > t ± . It also agrees with (J41j) when d — 3, m — 1 and 
x = t x . When d = 2m, (}4"Tj) has the solution 

, _ 1 q(l + 2-^/")-(l-2-^/") 
~ ad 1 + ■ ^ 

5.3 Successively depolarizing channels 

The next example generalizes the qutrit case in a different way. We now choose 
V k = Ei © W k with m = 1 so that 



(49) 



Y^a k V kP Vl = a E lP E 1 © \ S^b k W k Ei pE^WU + (1 - b) (Tr E^p) ^E^ 
k L V fc / 

with b k = b. Equivalently, 

$(p) = aE l pE 1 + (50) 

+ J2^ k W k E^pE^Wl + a(l - b) (Tr E^p) ^E^ + (1 - a)(Trp)±/ 

k 

Proceeding in this way, we can inductively construct a channel with the property 
that the input states |e 3 -)(ej| have strictly increasing output entropies, with each 
minimal when $ is restricted to states on EjLi, except that the last pair have equal 
entropy, i.e., S[$(|e d _i)(e d _i|)] = S[$(|e d )(e d |)]. 

We now make a change of notation so that x\ = J2 k a fc? x i — ^2 k bk, etc. Then 

1 — X\ 

$ : |ei)(ei| h-> xi|ei)(ei| H — / 

.... i \ / i 1 — 7-i I 1 — 2<1 

e 2 )(e 2 ' * xix 2 e 2 )(e 2 H-^i-j — r^i H 1 — J 

a — 1 a 



m m—1 ^ . 

|e m )(e m | i-> Y]_Xj\e m )(e m \ + JJaJj- — ~^+i E ™ + "' H 

3=1 3=1 



d-1 d-2 ^ 

led-iXe^l i-> IJz i |e (i _ 1 )(e (i _i| + J[xj ^ E d-i 

3=1 3=1 
3=1 
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\ e d)(ed\ i-> Y\. x i( 1 ~ x d-i)\ e d){ e d\ + n x i~^T E d-i 

3=1 3=1 
3=1 

5.4 Connection with CQ and classical channels 

For a channel $ of the type considered in the preceding sections, define gj k = 
(ej\${\e k )(e k \)\ej) so that 

$(|e fe )(e fc |) =^g k\e j ){e j \. (51) 

3 

Explicit expressions for the channels in Sections 15 . 21 and fo . 31 are given in Appendix P 
The matrix G is column stochastic, and the "successive" minimal entropy outputs 
are the same as for the CQ channel 

$cq(p) = (S*' fc l e ^^'l-) Trp ' efc ^ efc ' ^ 

k j 

Under the assumption that the "successive" minimal entropy inputs form a set of 
optimal inputs for the Holevo capacity, the optimization problem for the weights in 
the input ensemble {n m , \e m )(e m \} is the same as for the corresponding CQ channel. 
Moreover, the bistochastic matrix G defines a classical channel acting on classical 
probability vectors in R d . The optimization problem for the Shannon capacity of 
this channel is the same as that for the Holevo capacity of the CQ channel (p)2"|) . 

We expect the behavior of the examples in the previous sections to be similar 
to that of a qubit channel of the form 

\[I + w • cr] h-> \ [I + \ 1 Wicr 1 + X 2 w 2 (y2 + (^3 + -^3)0-3] (53) 

with A3 > A2 = Ai so that image is a football and the only non-unital component 
is a translation along the longest axis. For such channels, it is well- known 1221 
that the optimal inputs for the capacity ChoIv are the orthogonal states | [I ± a 3 ] . 
and the optimal weights are determined by the corresponding classical problem. 

If the conjecture for the examples in the preceding sections (that the optimal 
inputs are orthogonal states which correspond to "successive" minimal entropy in- 
puts) holds, then, although unital, they behave like the non-unital qubit channel 
above, i.e., they are closely related to a CQ and a classical problem with the same 
probability distribution for the optimal ensemble. This has been verified numeri- 
cally for the qutrit channels of Section 15.11 and the double depolarizing channels of 
Section 15.21 
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6 Numerical determination of capacity 



6.1 Description of the algorithms 

Our numerical work is based on the following variant of the max-min principle 
(Pi)-©) 

^Hoiv(^) < sup#[fi(o;),n(7)] (54) 

with equality if and only if fi( 7 ) = fi(p av ). The equality condition follows from the 
argument in j2H] which implies that if fi(p av ) 7^ ^(t)> then at least one of the inputs 
Pj in an optimal signal ensemble must satisfy 

ff[Q( Pj ),fi( 7 )] >CHolv(ft)+#[fi(Pav),fi(7)] >Cho1v(^). 

Note that this also implies that the optimal average output fi(p av ) is unique, a 
fact which can be proven directly from the strict concavity of the entropy. This 
uniqueness is implicit in j2U] and stated and proved explicitly in 29j. It can happen 
(as in the first example of Section 12. 4j) that there is more than one optimal signal 
ensemble or optimal average input; however, the optimal average output of a channel 
is always unique. 

Now suppose that we have a candidate for both the optimal average output 
f2(p* v ) and an associated candidate capacity Cjj olv (f2). 

a) If there is a state u such that C^ olv (il) < H[Q(u),Q(p* v )] we can conclude 
that the candidate is not the true optimal average. 

b) If Cjj olv (f2) = sup^g-p H\Q(uj), fi(p* v )] we can conclude that we have found the 
true optimal average and capacity, at least up to the accuracy of the numerical 
work. Moreover, the states uo which achieve this supremum are the optimal 
inputs for Q. 

To find the supremum in f)54j) . we used an algorithm based on an optimization 
principle of Shor (221 which is stated and proved as Theorem[7|in Appendix^! This 
algorithm finds relative, rather than absolute, maxima and is applied in situations in 
which some relative maxima are known (or expected) to satisfy (b) above. Therefore, 
for each channel tested, it is necessary to use it repeatedly with multiple inputs 
chosen to ensure that it will find a state satisfying (a) if one exists. 
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6.2 Numerical results 



6.2.1 Single use of channel 

We first tested our hypothesis that the "successive" minimal entropy states for the 
examples in Section|S]are optimal inputs for the Holevo capacity If this hypothesis is 
correct, the weights for the optimal ensemble are given by the optimization problem 
of Section EH Numerical tests were done only for the qutrit channels of Section EU 
and the double depolarizing channels of Section 15721 in the case d = 4, m = 2, with 
parameter choices similar to those tested for additivity. 

For the qutrit case, $(Pav) an d ChoIv(^) are gi yen by an d (|58Jl respectively 
with x given by (|JTJ). The parameters a& were chosen so that ao > a/2, and 
ao > cli > a 2 > a 3 with a = 0.5, 0.52, 0.54, ... 0.9 and for each of these ao = 
a/2 + 0.05, a/2 + 0.1 . . . until a exceeds a — 0.01. For each of these pairs, we 
considered aj = (a — ao)/3 as well as a selection of parameters with a% > a 2 > 03. 

For the d = 4, m = 2 case, $(p* v ) is given by (JEJ) and C^ olv (^) by flUIJ) with 
d = 4, m = 2 and t given by ()48j1 . All pairs of parameters a and b in the set 
{0.5, 0.55, 0.6, . . . 0.9} were tested. 

The starting inputs used in Theorem were chosen as follows. In both cases, for 
each set of parameters, 50 pure input states \ip)(ilj\ were obtained by normalizing the 
state |^) = J2k=i r k\k) where \k) denotes the standard basis for C d and the complex 
coefficients were chosen randomly. In both cases, for all choices of parameters, 
H[Q>(u), $(p* v )] < Ch i v ($) to an accuracy of 10 significant figures. 

6.2.2 Additivity 

We tested additivity of Cho1v($ <8> 3>) for the channels of Section 15.11 and those of 
Section 15.21 with d = 4, m = 2. In both cases, fi(Pav) = ^(Pav) ® ^(Pav) and 
ChoIv(^) = 2Cho1v( < £) with p av and ChoIvI^*) the expressions for a single use under 
the assumption that successively orthogonal minimal entropy inputs are optimal for 
the capacity. The assumption was tested numerically in the previous section. The 
results of this section give further support for this conjecture; if it were not true, one 
could find another pair of products with capacity greater than twice the C^ olv (^) 
from the previous section. 

The algorithm in Theorem always yields a sequence u>k for which 
H[($> (g) $)(a;fc), $(p av ) (g) $(p av )] in non-decreasing. Although the limiting state 
uj is stationary in the sense of (ffiTj) . the eigenvalue A need not equal the supremum 
in (JS~4J). Indeed, when testing additivity, products of optimal inputs will always be 
stationary states. Therefore, it is important to include starting points which do not 
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automatically converge to these stationary points if others exist. 

In choosing the parameters for testing additivity, it is reasonable to exclude 
values for which some restriction of the channel is entanglement breaking (EBT). 
Thus, we focus on values well away from the EBT regions for the corresponding 
depolarizing channel, i.e., a < 0.25 for d = 3 and a < 0.2, b < | for d = 4 in 
Section l5~21 Similarly, for qutrits, we choose > |a. We do not claim that channels 
with some EBT parameters are EBT or that we can prove additivity. However, it 
would be quite extraordinary if a channel of the form (}4*3*|) with parameters in (or 
near) the EBT regions were super-addditive when those with larger values were not. 

Because the double depolarizing examples offer possibilities for entanglement 
across regions in ways not previously tested numerically, we concentrated on this 
case. For d = 4, m = 2, we considered all pairs of parameters a, b in the set 
{0.5, 0.52, 0.54, ... 0.98}. For each pair, we used the following selection of input 
states (which are described with the convention that \k) denotes the standard basis 
in C 4 ): 

i) 10 random pure states \ifj)(ip\, where is obtained by normalizing the state 

4 4 

$)= y %2'52r jk \j)®\j), 

3=1 3=1 

with complex coefficients chosen randomly. 

ii) 10 maximally entangled input states where 

|V>) = Ci|l) <g> |3) + c 2 |2) g> 1 4) + c 3 |3) <g> |2) + c 4 |4) ® |1). 
with Cfc = (1/2) exp(i#fc) and 9k chosen randomly in [0, 2tt]. 

iii) 10 pure input states where \ip) is obtained by normalizing the state 

4 

\$) = X)^*)® 
i=i 

with each chosen randomly as in Section f6. 2. II 

For d = 3, the same parameter values were used as in Sect ion lb . 2 . 1 1 wit h 30 random 
input pure states chosen as described in (i) above. 

In all the situations tested, C , h i v (^ > ® $) agrees with 2Ch i v (^ > ) to 10 significant 
figures. 
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7 Discussion 



We have considered the effect of modifying a depolarizing channel by replacing ap, 
the first term in by different convex combinations of unitary conjugations. We 
have shown that this leads to a rich variety of examples, some of which exhibit 
behavior previously associated with non-unital channels. Nevertheless, we prove a 
number of results, including the additivity of minimal output entropy. 

To relate our results to other recent work, let M(p) = J2k x kVkpV^ with Xk = ^ 
as in (2). Then the channel in (J2J) can be written as $ = r^ ep o M, and Fukuda's 
lemma ^U] can be applied to give an alternate proof of parts (b) and (d) of The- 
orem El When the Vk have a common eigenvector, M(p) has an output state of 
rank one so that Fukuda's lemma can be applied to the composition of M(p) with 
other unitarily invariant channels as discussed in jTUj. In addition, the channel 
T(p) = j^jUTr p)I — M(pj\ has an output which is a multiple of a projection. 
Therefore, the results of Wolf and Eisert [23] imply that additivity © and multi- 
plicativity (J2*U|) with 1 < p < 2 hold for tensor products of channels T(p) in the 
"strong" sense defined in [33]. Channels M(p) generated from diagonal Vk as in Sec- 
tion 0] were considered in [33]; however, using the Vk from the asymmetric examples 
of Section E3 to generate T(p) via M(p) gives new examples. 

Instead of modifying the first term in (0), one could change the second to obtain 
the channel 

$(p) = ap + (l-a)(Trp)7 (55) 

with 7 a fixed density matrix. The simplest such example is the shifted depolarizing 
channel 7 = 4(1 — b)I + b\ip)(ip\, for which additivity (JBJ and multiplicativity (J2(J|) 
for all p > 1 have now been proved by Fukuda |ll)j . However, the only results which 
have been proved for the general channel ()55|) are multiplicativity in the case p = 2 
|12j . and higher integers [23]. Despite recent progress for special cases, resolving the 
additivity conjectures remains a challenge. 

Acknowledgment This work began when MBR was a participant in the program 
on Quantum Information at the Isaac Newton Institute at Cambridge University 
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A Shor's optimization algorithm 



Our numerical results use the following theorem due to Shor 

Theorem 7 Let Vt be a CPT map and its adjoint with respect to the Hilbert- 
Schmidt inner product. Let ip be the eigenvector corresponding to the largest eigen- 
value o/fi[logfi(/>) - log A]. Then H[n(\ip){ip\),A] > H[Q(p),A]. 

Proof: The largest eigenvalue of f2[logf2(p) — log A] is 

A = sup(ip,tt[logtt(p) - log A]ip) (56) 

i> 

= supTr\if))(il)\n[logn(p) -log A] (57) 

where the supremum is over vectors ip with \\ipW = 1. Let 7 = \ip){ip\ for the vector 
which attains this supremum. Then 

Tr fi( 7 ) [ log Q(p) -log A] = Tr 7 Q [ log Q(p) - log A] 

> Trpfi[logfi(p) -log A] (58) 

= H[Q(p),A] 

so that 

#[n(7),A]-#[Q(p),A] (59) 
= tf[ft(7),ft(p)]+Trft( 7 ) [logQ(p) - log A] -H[Q(p), A] 
> QED (60) 

Given a starting p = |'0o))('0o)|, let 71 = 7 = be the eigenvector before 

(J55|) . and inductively define 7^+1 = \i[)k + i) (ipk+il using the eigenvalue equation for 
7^. This gives a sequence for which if[fi( 7 fc), fi(p)} increases to a stationary point 
uj satisfying 

tt[logtt(p) -log A] a; = Aw. (61) 

B Qubit channel details 

It was shown in that any unital qubit channel can be written as 

3 

$ (/°) = V^jOL k a k (U P lP)a^ (62) 
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with U, V unitary , the > with J2k a k — 1> °o = I and (jj, j = 1, 2, 3 the three 
Pauli matrices. There is no loss of generality in assuming that «o > <x,- (j = 1, 2, 3); 
if, instead, ay is largest, one can factor out o~j and rewrite $ in the form ()62|) with 
V — > Vcrj. Similarly, one can choose [/, V to correspond to rotations in R 3 so that 
«i > Qfj (j — 2, 3). Finally, since the only effect of U, V is to make change of bases 
which have no effect on the minimal output entropy or the Holevo capacity, we can 
assume that U — V — I. Thus, there is no loss of generality in assuming that $ 
has the form ()15j) with ao > ol\ > atj j = 2, 3. If, in addition, ao > §, the channel 
is not EBT [26 . Thus, we often assume that 

cto > \ > a i > otj j = 2, 3. (63) 

The parameters a^, k — 0, 1, 2, 3 and Aj, z = 1, 2, 3, in (fT5j) and (fT6j) are related 
by the conditions 

1 = a + ai + a 2 + a 3 (64) 
Aj = ao + oii — aj — a,i — 2(a + ««) — 1 (65) 

with the understanding that i, j, I are distinct. Then the input states |(/±<7j) have 
output states |(J ± AjCTj) whose eigenvalues are 



i(l±A,)= - ' - . (66) 

I OLj + tt; = 1 — «o — a i 

The image of the Bloch sphere is an ellipsoid whose axes have lengths \Xj\, j = 1, 2, 3 
with the output states above at the ends of the axes. Under the order assumption 
all Xj > and the states with optimal output purity satisfy (J6T)J) with % — 1. 

In the discussion of Section 15.11 a.^ = ^ and one uses suitably modified forms 
of equations (jo^|) - (|6^j) . 



C CQ matrices 

For a channel $ of the type considered in 
given by 

l-q 

9jk = \ab+ a -^ + ^ 

d—m a 
a(l-b) _|_ l- a 
^ d—m d 



Section l5~2~l the matrix defined in (|5Tj) is 

j = k < m 

j ^ k, j < m or k < m , , 

- ^ 

j = A; > m 

j ^ k,j,k <m . 
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For a channel of the type considered in Section 15.31 it is 

1 — X\ 



9jk 



d 



i II' 



\'d-j+: 



9jk 

^d-l.d-l 



fc>l,J = l 

fc > j > 1 

k = j < d 

k<j 

k = j = d 



(68) 
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